NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation

https://doi.org/10.18653/v1/2025.emnlp-main.587

Wang, Song; Chen, Zihan; Wang, Peng; Wei, Zhepei; Tan, Zhen; Meng, Yu; Shen, Cong; Li, Jundong (November 2025, Association for Computational Linguistics)

Full Text Available
RayZer: A Self-supervised Large View Synthesis Model

Jiang, Hanwen; Tan, Hao; Wang, Peng; Jin, Haian; Zhao, Yue; Bi, Sai; Zhang, Kai; Luan, Fujun; Sunkavalli, Kalyan; Huang, Qixing; et al (October 2025, IEEE/CVF, International Conference on Computer Vision)

Full Text Available
Attention-Only Transformers via Unrolled Subspace Denoising

Wang, Peng; Lu, Yifu; Yu, Yaodong; Pai, Druv; Qu, Qing; Ma, Yi (May 2025, International Conference on Machine Learning)

Full Text Available
Ampere-level co-electrosynthesis of formate from CO2 reduction paired with formaldehyde dehydrogenation reactions

https://doi.org/10.1038/s41467-025-60008-9

Li, Zhengyuan; Wang, Peng; Han, Guanqun; Yang, Shize; Roy, Soumyabrata; Xiang, Shuting; Jimenez, Juan D; Kondapalli, Vamsi_Krishna Reddy; Lyu, Xiang; Li, Jianlin; et al (December 2025, Nature Communications)

Full Text Available
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models

Wang, Song; Wang, Peng; Zhou, Tong; Dong, Yushun; Tan, Zhen; Li, Jundong (April 2025, International Conference on Learning Representations)

As Large Language Models (LLMs) are increasingly deployed to handle various natural language processing (NLP) tasks, concerns regarding the potential negative societal impacts of LLM-generated content have also arisen. To evaluate the biases exhibited by LLMs, researchers have recently proposed a variety of datasets. However, existing bias evaluation efforts often focus on only a particular type of bias and employ inconsistent evaluation metrics, leading to difficulties in comparison across different datasets and LLMs. To address these limitations, we collect a variety of datasets designed for the bias evaluation of LLMs, and further propose CEB, a Compositional Evaluation Bechmark that covers different types of bias across different social groups and tasks. The curation of CEB is based on our newly proposed compositional taxonomy, which characterizes each dataset from three dimensions: bias types, social groups, and tasks. By combining the three dimensions, we develop a comprehensive evaluation strategy for the bias in LLMs. Our experiments demonstrate that the levels of bias vary across these dimensions, thereby providing guidance for the development of specific bias mitigation methods.
more » « less
Full Text Available
Langmuir Mixing Schemes Based on a Modified K‐Profile Parameterization

https://doi.org/10.1029/2024MS004729

Wang, Peng; McWilliams, James C; Yuan, Jianguo; Liang, Jun‐Hong (April 2025, Journal of Advances in Modeling Earth Systems)

Abstract Langmuir turbulence, a dominant process in the ocean surface boundary layer, drives substantial vertical mixing that influences temperature, salinity, mixed layer depth, and biogeochemical tracer distributions. While direct resolution of Langmuir turbulence in ocean and climate models remains computationally prohibitive, its effects are commonly parameterized, frequently within established turbulent mixing frameworks like the K‐profile parameterization (KPP). This study utilizes a modified KPP that determines boundary layer depth through an integral criterion, diverging from the conventional KPP's dependence on the bulk Richardson number. The modified KPP demonstrates markedly lower sensitivity to model vertical resolution than its conventional counterpart. Building upon this modified KPP framework, we introduce an innovative parameterization scheme for Langmuir mixing effects. We evaluate the performance of this new scheme against existing approaches using a one‐dimensional (1D) column model across four different scenarios, incorporating validation against both large eddy simulation (LES) results and field measurements. Our analysis reveals that the new Langmuir mixing scheme, explicitly designed for the modified KPP framework, performs competitively while maintaining reduced sensitivity to vertical resolution.
more » « less
Full Text Available
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning

Yaras, Can; Chen, Siyi; Wang, Peng; Qu, Qing (March 2025, Second Conference on Parsimony and Learning (CPAL 2025))

Multimodal learning has recently gained significant popularity, demonstrating impressive performance across various zero-shot classification tasks and a range of perceptive and generative applications. Models such as Contrastive Language–Image Pretraining (CLIP) are designed to bridge different modalities, such as images and text, by learning a shared representation space through contrastive learning. Despite their success, the working mechanisms of multimodal learning remain poorly understood. Notably, these models often exhibit a \emph{modality gap}, where different modalities occupy distinct regions within the shared representation space. In this work, we conduct an in-depth analysis of the emergence of modality gap by characterizing the gradient flow learning dynamics. Specifically, we identify the critical roles of mismatched data pairs and a learnable temperature parameter in causing and perpetuating the modality gap during training. Furthermore, our theoretical insights are validated through experiments on practical CLIP models. These findings provide principled guidance for mitigating the modality gap, including strategies such as appropriate temperature scheduling and modality swapping. Additionally, we demonstrate that closing the modality gap leads to improved performance on tasks such as image-text retrieval.
more » « less
Full Text Available
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning

Yaras, Can; Chen, Siyi; Wang, Peng; Qu, Qing (March 2025, The Second Conference on Parsimony and Learning)

Full Text Available
Applicable and generalizable machine learning for intelligent welding in automotive manufacturing

https://doi.org/10.1007/s40194-025-01951-5

Wang, Peng Edward; Ghassemi-Armaki, Hassan; Pour, Masoud; Zhao, Xijia; Ma, Junjie; Sattari, Kianoosh; Carlson, Blair (May 2025, Welding in the World)

Abstract This review paper examines the application and challenges of machine learning (ML) in intelligent welding processes within the automotive industry, focusing on resistance spot welding (RSW) and laser welding. RSW is predominant in body-in-white assembly, while laser welding is critical for electric vehicle battery packs due to its precision and compatibility with dissimilar materials. The paper categorizes ML applications into three key areas: sensing, in-process decision-making, and post-process optimization. It reviews supervised learning models for defect detection and weld quality prediction, unsupervised learning for feature extraction and data clustering, and emerging generalizable ML approaches like transfer learning and federated learning that enhance adaptability across different manufacturing conditions. Additionally, the paper highlights the limitations of current ML models, particularly regarding generalizability when moving from lab environments to real-world production, and discusses the importance of adaptive learning techniques to address dynamically changing conditions. Case studies like virtual sensing, defect detection in RSW, and optimization in laser welding illustrate practical applications. The paper concludes by identifying future research directions to improve ML adaptability and robustness in high-variability manufacturing environments, aiming to bridge the gap between experimental ML models and real-world implementation in automotive welding.
more » « less
Full Text Available
Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing

Chen, Siyi; Zhang, Huijie; Guo, Minzhe; Lu, Yifu; Wang, Peng; Qu, Qing (December 2024, Advances in Neural Information Processing Systems)

Recently, diffusion models have emerged as a powerful class of generative models. Despite their success, there is still limited understanding of their semantic spaces. This makes it challenging to achieve precise and disentangled image generation without additional training, especially in an unsupervised way. In this work, we improve the understanding of their semantic spaces from intriguing observations: among a certain range of noise levels, (1) the learned posterior mean predictor (PMP) in the diffusion model is locally linear, and (2) the singular vectors of its Jacobian lie in low-dimensional semantic subspaces. We provide a solid theoretical basis to justify the linearity and low-rankness in the PMP. These insights allow us to propose an unsupervised, single-step, training-free LOw-rank COntrollable image editing (LOCO Edit) method for precise local editing in diffusion models. LOCO Edit identified editing directions with nice properties: homogeneity, transferability, composability, and linearity. These properties of LOCO Edit benefit greatly from the low-dimensional semantic subspace. Our method can further be extended to unsupervised or text-supervised editing in various text-to-image diffusion models (T-LOCO Edit). Finally, extensive empirical experiments demonstrate the effectiveness and efficiency of LOCO Edit.
more » « less
Full Text Available

« Prev Next »

Search for: All records